Machine or Human? Evaluating the Quality of a Language Translation Mobile App for Diabetes Education Material
نویسندگان
چکیده
Background: Diabetes is a major health crisis for Hispanics and Asian Americans. Moreover, Spanish and Chinese speakers are more likely to have limited English proficiency in the United States. One potential tool for facilitating language communication between diabetes patients and health care providers is technology, specifically mobile phones. Objective: Previous studies have assessed machine translation quality using only writing inputs. To bridge such a research gap, we conducted a pilot study to evaluate the quality of a mobile language translation app (iTranslate) with a voice recognition feature for translating diabetes patient education material. Methods: The pamphlet, “You are the heart of your family...take care of it,” is a health education sheet for diabetes patients that outlines three recommended questions for patients to ask their clinicians. Two professional translators translated the original English sentences into Spanish and Chinese. We recruited six certified medical translators (three Spanish and three Chinese) to conduct blinded evaluations of the following versions: (1) sentences interpreted by iTranslate, and (2) sentences interpreted by the professional human translators. Evaluators rated the sentences (ranging from 1-5) on four scales: Fluency, Adequacy, Meaning, and Severity. We performed descriptive analyses to examine the differences between these two versions. Results: Cronbach alpha values exhibited high degrees of agreement on the rating outcomes of both evaluator groups: .920 for the Spanish raters and .971 for the Chinese raters. The readability scores generated using MS Word’s Flesch-Kincaid Grade Level for these sentences were 0.0, 1.0, and 7.1. We found iTranslate generally provided translation accuracy comparable to human translators on simple sentences. However, iTranslate made more errors when translating difficult sentences. Conclusions: Although the evidence from our study supports iTranslate’s potential for supplementing professional human translators, further evidence is needed. For this reason, mobile language translation apps should be used with caution. (JMIR Diabetes 2017;2(1):e13) doi:10.2196/diabetes.7446
منابع مشابه
The Correlation of Machine Translation Evaluation Metrics with Human Judgement on Persian Language
Machine Translation Evaluation Metrics (MTEMs) are the central core of Machine Translation (MT) engines as they are developed based on frequent evaluation. Although MTEMs are widespread today, their validity and quality for many languages is still under question. The aim of this research study was to examine the validity and assess the quality of MTEMs from Lexical Similarity set on machine tra...
متن کاملInvestigating the Quality of Persian Mobile Applications Related to Patients with Chronic Diseases
Introduction: Today, the use of mobile applications to help self-care in patients with chronic diseases has increased. The objective of this study was to investigate the quality of Persian mobile applications related to patients with diabetes and hypertension. Method: This analytical study was conducted on all Persian mobile applications related to diabetes and hypertension in 2019. The mobile ...
متن کاملInvestigating the Quality of Persian Mobile Applications Related to Patients with Chronic Diseases
Introduction: Today, the use of mobile applications to help self-care in patients with chronic diseases has increased. The objective of this study was to investigate the quality of Persian mobile applications related to patients with diabetes and hypertension. Method: This analytical study was conducted on all Persian mobile applications related to diabetes and hypertension in 2019. The mobile ...
متن کاملAgile Development of a Custom-Made Vocabulary Mobile Application: A Critical Qualitative Approach
There have been some observed studies and developed applications (apps), with a concentration on Mobile Assisted Language Learning (MALL), and no consideration of communicative needs of the learners; besides, these studies focused on either the theoretical aspects or the utilization of the available apps in the market (Burston & Athanasiou, 2020). Hence, Vocabulary Guru (VG), a custom-made mobi...
متن کاملTranslation Quality Assessment of English Equivalents of Persian Proper Nouns: A case of bilingual tourist signposts in Isfahan
Abstract This study evaluated the translation quality of English equivalents of Persian proper nouns in the tourist signs and bilingual boards in Isfahan. To find different errors in the translations of the bilingual boards and tourist signs, the data were collected directly by taking picture or writing exactly from the available tourist signs and bilingual boards. Then, the errors were assesse...
متن کامل